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Abstract 

The  Bayesian  approach  has  not  been  very  fruitful  in 
treating  nonparametric  statistical  problems,  due  to  the 
difficulty  in  finding  mathematically  tractable  prior 
distributions  on  a  set  of  probability  measures.  The  theory 
of  the  Dirichlet  process  has  been  developed  recently.  The 
process  generates  randomly  a  family  of  probability 
distributions  which  can  be  taken  as  a  family  of  prior 
distributions  for  the  Bayesian  analysis  of  some 
nonparametric  statistical  problems.  This  paper  deals  with 
the  problem  of  selecting  a  distribution  with  the  largest  pti. 
quantile  value,  from  k  2  2  given  distributions.  It  is 
assumed  i  priori  that  the  given  distributions  have  been 
generated  from  a  Dirichlet  process. 


*  The  work  was  supported  by  the  U.S.  Office  of  Naval 
Research  under  Contract  No.  00014-75-C-0451 . 


1.  Introduction.  The  Bayesian  approach  has  been  very  useful 
in  treating  statistical  problems  of  parametric  nature.  In 
treating  nonparametric  statistical  problems,  the  scope  of 
Bayes  methods  is  limited  due  to  the  difficulty  of  finding 
mathematically  tractable  prior  distributions  on  a  set  of 
probability  measures.  For  practical  application,  the  family 
of  prior  distributions  should  be  such  that  (i)  its  support 
is  large  and  that  (ii)  the  posterior  distribution,  given  a 
sample  drawn  from  the  true  distribution,  is  analytically 
manageable.  In  a  couple  of  papers,  Ferguson  (1973,  1974) 
has  developed  the  theory  of  the  Dirichlet  process.  The 
process  generates  randomly  a  family  of  probability 
distributions  which  can  serve  as  a  family  of  distributions, 
with  the  properties  (i)  and  (ii).  Therefore,  the  process 
can  be  used  in  the  analysis  of  some  statistical  problems  of 
nonparametric  nature,  by  the  Bayes  method. 

In  this  paper  we  consider  a  problem  of  ranking  and 
selection.  Given  a  sample  of  size  n,  drawn  from  each  of  k 
univariate  distributions  Q^,...,Qk,  we  want  to  select  the 
population  associated  with  the  largest  pth  quantile  value. 


which  we  call  the  "best”  population.  A  Bayes  procedure  for 
selecting  the  best  population  is  given.  It  is  assumed  A 
priori  that  the  given  distributions  are  k  independent 
realizations  of  a  Dirichlet  process.  The  given  procedure 
may  be  considered  as  a  Bayes  solution  of  a  nonparametric 
problem.  In  Section  2  we  describe  the  Dirichlet  process  and/ 


its  properties  relating  to  the  selection  problem.  In 
Section  3  we  describe  the  selection  rule  and  derive  a 


formula  for  the  probability  of  a  correct  selection. 

The  problem  of  selecting  the  population  with  the 
largest  pth  quantile  value  arises  in  various  situations  of 
practical  interest.  Gibbons,  Olkin  and  Sobel  (1977)  have 
given  a  number  of  examples  to  illustrate  the  importance  of 
the  problem. 

The  problem  of  selecting  the  population  with  the 
largest  pth  quantile  value,  from  k  populations  and  the 
analogous  problem  of  selecting  a  subset  of  the  k  populations 
which  includes  the  best  population,  have  been  considered  by 
Sobel  (1967),  Rizvi  and  Sobel  (1967)  and  Desu  and  Sobel 
(1971).  The  thrust  of  these  papers  is  to  find  a  "least 
favorable”  configuration  of  the  populations  for  which  the 
probability  of  a  correct  selection  is  minimized  and  to 
determine  a  minimum  sample  size  n  such  that  the  probability 
of  a  correct  election  is  at  least  as  large  as  a  specified 
number  less  than  1.  Rizvi  and  Saxena  (1972)  give  a 
confidence  interval  for  the  largest  pth  quantile. 

2.  Dirichlet  Process.  First  we  describe  the  Dirichlet 
distribution.  Let  be  k  independent  random 
variables,  where  is  distributed  according  to  the  gamma 
distribution  with  degrees  of  freedom  and  a  common  scale 
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k 

parameter,  i  =  l,...k.  Let  2.  =  X./  (I  X.).  The 

1  1  i=l  1 

Dirichlet  distribution  with  parameter  is  given 

as  the  joint  distribution  of  Z . ,Z^.  Marginally,  Z ^  is 
distributed  according  to  the  beta  distribution  with 
parameter  (v^  v^+ .  .  . +vi_1+vi  +  i+ .  .  . . 

The  Dirichlet  process  is  defined  on  a  general  space, 
but  for  the  purpose  of  this  paper  we  consider  only  (R,B), 
where  R  denotes  the  real  line  and  B  denotes  the  a-algebra  of 
all  Borel  subsets  of  R.  Let  a(.)  be  a  finite  measure  on 
(R,B),  and  let  Q  be  a  stochastic  process  indexed  by  the 
elements  of  B.  We  say  that  Q  is  a  Dirichlet  process  with 


parameter  a,  and  write  Q  e 

D(«) , 

if 

for 

every 

finite 

measurable  partition  (B^, . . . 

'V 

of 

R. 

the 

vector 

(Q(B1) , • . . ,Q(Bm) )  is  distributed  according  to  the  Dirichlet 
distribution  with  parameter  (b(B1) , . . . ,a(Bm) ) .  Thus  Q  is  a 
random  probability  distribution  on  R,  and  Q(A)  represents 
the  probability  measure  of  A  under  Q,  for  A  e  B. 

It  is  known  that  Q  is  discrete  with  probability  1,  and 
that  if  X^,...,X  is  a  sample  from  Q  then  a  posteriori, 
n 

Q  t  D(a  +16  ),  where  6  is  a  measure  which 

i=l  X±  x 

assigns  unit  mass  to  the  single  point  x.  It  is  a  drawback 
that  Q  is  discrete  with  probability  1.  We  should  have  a 
prior  that  chooses  a  continuous  distribution  with 
probability  1.  However,  Ferguson  (1973)  points  out  that  the 
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discreteness  of  Q  does  not  limit  the  use  of  the  Dirichlet 
process  as  a  family  of  prior  distributions  in  certain 
problems,  such  as  the  estimation  of  the  quantiles. 

We  give  below,  certain  properties  of  the  Dirichlet 
process  which  will  be  used  in  the  sequel.  The  following 
result  is  due  to  Ferguson  (1973).  Let  Q  be  a  realization  of 
the  Dirichlet  process  with  parameter  a,  and  let  M  -  o(R). 
For  estimating  Q  let  the  loss  function  be  given  by 


L(Q,Q)  *  -  Q(t))2  dw(t) 


where  Q  denotes  the  estimate  of  Q, 
a  given  finite  measure  on  (R,B). 

Q  is  EQ  =  Q°,  say,  where 


Q(t)  -  Q((— ,t])  and  w  is 
Then  a  Bayes  estimator  of 


(2.1)  Q°(  t)  =  («(—  ,t])/M. 


The  distribution  Q°  is  our  prior  guess  of  Q.  If  a  sample  of 
size  n  is  drawn  from  Q,  the  Bayes  estimator  is  given  by 


Q  =  PnQ°  +  <l-Pn)F 


(2.2) 
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where  pn  =  M/(M+n)  and  F  denotes  the  empirical  distribution 
function  of  the  sample. 

Let  £  denote  the  pth  quantile  of  Q,  given  by 

Q(<—  ,£))  *  P  *  Q(  (  — ,S  1 ) 


for  0<p  <1.  Itis  known  that  £  is  uniquely  determined 
with  probability  1.  For  estimating  £  let  the  loss  function 
be  given  by 


q<5-$) 

MS,  S  )  = 

(1-q)  (S~S) 


for  £  2  £ 


for  £  <  £ 


where  £  is  an  estimate  of  £  and  q  is  a  given  number,  such 
that  0  <  q  <  1.  Any  qth  quantile  of  the  distribution  of  £ 
is  a  Bayes  estimate  of  the  realized  value  of  £,  under  the 
given  loss.  Let  b(x;a,c)  denote  the  beta  density  function 
and  let  u  =  u(p,q,M)  be  a  solution  of  the  equation. 


(2.3)  I (p;uM,  (l-u)M)  =  1  -  q 


where 
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I  ( x ;  a ,  c )  =  /*  b ( y ; a , c ) dy 

denotes  the  cdf  of  the  beta  distribution.  Then  the 

u(p,q,M)th  quantile  of  Q°  is  a  Bayes  estimate  of  £.  Given  a 
sample  of  size  n  drawn  from  Q,  a  Bayes  estimate  is  the 

u(p,q,M+n)th  quantile  of  Q,  given  by  (2.2).  Ferguson  has 
tabulated  the  values  of  u(p,q,M)  for  q  =  .05(.05).95,  p  = 
.05 (.05). 95  and  M  =  1(1)10. 

3.  Selection  rule.  Let  denote  the  pth  quantile  of 
and  let  F^^  denote  the  empirical  distribution  function  of  the 
sample  drawn  from  From  (2.2)  we  have  that 


(3.1) 


®i  = 


Pn  2 


^ 1”Pn>Fi 


is  a  Bayes  estimator  of  Qi  and  that  5i,  equal  to  the 

u(p,q,M+n)th  quantile  of  ,  is  a  Bayes  estimator  of 

where  the  function  u  =  u(p,q,M)  is  given  by  (2.3). 

Therefore,  we  select  the  population  associated  with  the 

largest  value  of  as  the  best  population.  If  two  or  more 
— 

of  the  are  tied  for  the  largest  value  we  select  one  of 
them  randomly.  We  shall  ignore  in  the  following  discussion 
the  occurrence  of  a  tie. 


8 


The  probability  of  a  correct  selection  (PCS)  for  the 
given  rule,  is  obtained  as  follows:  Let  v  =  u(p,q,M+n)  and 

y(x)  =  fn(v-pn  Q°(x))/(l-pn) ] 

where  [x]  denotes  the  smallest  integer  greater  than  or  equal 
to  x. 

Since  the  selection  rule  is  symmetric,  the  PCS  is  given 

by 

(3.2)  PCS  =  KP(5.  S  £k,  $.  S(K,  i  =  1 . k-1) 

=  K  S_m  f_m  GK“1(x,y)dG(x,y) 

where 

(3.3)  G(x,y)  =  PUA  s  x,  s  y> 

=  PfQ^x)  2  p,  Q.(y)  2  v> 

=  PtQ^x)  2  P.  F^y)  2  (v-pnQ°(y)  )/(l-pn)). 

We  have 

PfF^y)  2  (v-pnQ°(y))/(l-pn)  Q.) 

*  KQ^y);  *(y),  n+l-r<y)). 
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Therefore 


(3.4)  G(x,y)  =  J  /  I ( t ; y ( y ) .  n+l-y(y) )dH(s, t) 

sip 

where  H(s,t)  denotes  the  joint  cdf  of  s  =  Q^(x)  and  t  =  Q^y). 

Now  (Qi(x),  Qi(y)-Qi(x)/  1-Qi(y))  is  distributed 
according  to  the  Dirichlet  distribution  with  parameter 
(a(x),  o(y)-o(x),  M-a(y))  for  x  <  y,  and  (Q±(y)/ 

Qi(x)-Qi(y) ,  1-Qi(x)  is  distributed  according  to  the  same 

distribution  with  parameter  (o(y),  a(x)-a(y),  M-a(x))  for 

x  >  y.  Given  Q^y),  Q^xJ/Q^y)  is  conditionally 

distributed  according  to  the  beta  distribution  with 
parameter  (a(x),  a(y)-a(x) )  for  x  <  y,  and 

(1-Qi(x))/(1-Qi(y) )  is  conditionally  distributed  according 
to  the  beta  distribution  with  parameter  (M-a(x),  a(x)-a(y)) 

for  x  >  y.  Let  $(x,y)  =  1(0)  for  x  <(>)  1  and  let  I(«;a,b) 

=  1  for  u  i  1.  From  (3.4)  we  have 

(3.5)  G(x,y)  =  I(«;  T ( y) ,  n+l-y(y) ){ ( 1-1 (®;«(x) ,a( y) -a(x) ) ) 

«(x,y)  +  (l-*(x,y)  )Iq?w;M-a(x)  ,a(x)-a(y) )} 
b(u;  «(y),  M-a(y))dw 


for  x  *  y  and 
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(3.6)  G(y,y)  =  I(w;»(y),  n+l-r ( y ) )b(u;  a(y);  M-a(y))dw 

The  expression  for  the  PCS  given  above,  involves 
several  parameters,  namely,  k,  p,  q  and  n,  in  addition  to 
the  measure  a.  Given  k,  p,  q  and  a,  a  minimum  value  of  n 
can  be  determined  such  that  the  value  of  the  PCS  is  at  least 
as  large  as  a  specified  number  between  0  and  1. 

Through  a  minor  modification  of  the  selection  rule 
given  above,  we  can  obtain  a  procedure  for  selecting  a 
random  subset  of  the  given  populations  which  includes  the 
best  population.  The  above  result  can  be  used  to  obtain  the 
probability  of  a  correct  selection  for  that  procedure. 

For  an  illustration  of  the  given  result,  it  would  be 
convenient  to  consider  the  special  case  in  which  o 
represents  the  uniform  distribution  on  (0,1).  For  this  case 
we  have 


G(x, y)  =  I (u; [ ( n+1 ) v-y ] ,  n+1- [ (n+1 )v-y ] ) 

{(l-I(J;x,y-x)  )<Hx,  y)+( l-*(x,  y) ) 

I ( l_x' x~y ) )  b(u, y, 1-y )du,  x  *  y 

G(y,y)  =  /p  I(u; I (n+l)v-y] ,  n+1- [ (n+1 )v-y 1 ) 
b ( u ;  y,  1 - y ) du . 
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4.  Conclusion.  In  this  paper  we  have  developed  the  theory 
of  the  Dirichlet  process  for  application  to  a  non-parametric 
problem  of  ranking  and  selection.  The  given  results  are 
mainly  of  theoretical  nature.  For  practical  application  it 
would  be  necessary  to  justify  or  test  the  assumption  that 
the  underlying  distributions  have  been  generated  from  a 
Dirichlet  process.  There  are  also  questions  relating  to  the 
prior  specification  of  the  parameter  a  and  to  the  robustness 
of  the  statistical  procedures  resulting  from  the  given 
theory,  as  compared  to  certain  parametric  statistical 
procedures.  We  shall  discuss  the  application  side  of  the 
selection  problem  in  another  paper. 


12 


References 


1.  Desu,  M.M.  and  Sobel,  M.  (1971).  Nonparametric 
procedure  for  selecting  fixed-size  subsets.  Statistical 
Decision  Theory  and  Related  Topics  (Ed.  Gupta,  S.S.  and 
Yackel,  J.),  Academic  Press,  New  York. 

2.  Ferguson,  T.S.  (1974).  A  Bayesian  analysis  of  some 
nonparametric  problems.  Ann.  Statist.  (1)  209-230. 

3.  Ferguson,  T.S.  (1974).  Prior  distributions  on  spaces  of 
probability  measures.  Ann.  Statist.  (2)  615-629. 

4.  Gibbons,  Olkin  and  Sobel  (1977).  Selecting  and  Ordering 
Populations  -  A  New  Statistical  Methodology.  John 
Wiley,  New  York. 

5.  Rizvi,  M.H.  and  Saxena,  K.M.L.  (1972).  Distribution- 
free  interval  estimation  of  the  largest  a-quantile. 
Jour.  Amer.  Statist.  Assn.  (67)  196-198. 

6.  Rizvi,  M.H.  and  Sobel,  M.  (1967).  Nonparametric 
procedures  for  selecting  a  subset  containing  the 
population  with  the  largest  a-quantile.  Ann.  Math. 
Statist.  (38)  1788-1803. 

7.  Sobel,  M  (1967).  Nonparametric  procedures  for  selecting 
the  t  populations  with  the  largest  a-quantiles.  Ann. 
Math.  Statist.  (38)  1804-1816. 


UNCLASSIFIED _ 

skcuwtv  classification  op  rmt  paos  omm  «wi t«o 

REPORT  DOCUMENTATION  PAGE  iircMtcovtStn^roiw 

I.  NUNlI*  !2.  OOVT  ACCESSION  NO.  J.  NtCIPISHT'S  CAT *1.00  mumScr 

_ nr  130 _ Jap,  Ai ri~  _ 

7  TITLS  f«iP  Sutilitn)  1  1  S'  TVPt  OP  NtPONT  5i  PCNIOO  COVtNSO 


A  Bayes  Procedure  for  Selecting  Che 
Population  with  the  Largest  pth  Quantile 


7.  AUTMOHfw 


pcrponmino  oao.  hkpont  number 

Technical  Report  #368 
(.  C0NTP*iT  or  an  ant  numbcrt.) 


Khursheed  Alam 


N00014-75-C-0451 


s.  pcrpormino  oroanizatio n  mans  ano  address 
Clems on  University 
Dept,  of  Mathematical  Sciences 
Clemaon,  South  Carolina  29631 

II.  CONTROLLING  OPPICE  MAMS  ANO  AOOREJS 

office  of  Naval  Research 
Code  434 

Arlington.  Va.  22217 _ 


10.  NROORAM  CLEMENT.  PROJECT.  TASK 
AREA  A  PORK  UNIT  NUMBERS 


NR  365-049 


12.  REPORT  OATS 

July,  1981 

<s.  number  op  packs 

12 


4.  MONITORING  AOtNCY  NAME  A  AOOftCSVJf  dUimrmt  from  Controiifftf  Oitif)  1  t*.  SSCURITY  CLASS.  (•!  Mb  « 


I  01  ST  HI  SO  TJ  ON  STATEMENT  (ot  thia  Rmporl) 


Unclassified 

la.  OCCL  ASSl AfC ATtON/ DOWNS* AblNO 
SCHEOULE 


Approved  for  public  release?  distribution  unlimited. 


17.  DISTNiauTION  STATKMKNT  (»l  A*  ateirMI  «NnS  In  IIm*  20.  II  mllnrwnt  tn 


10.  SUPPLEMENTARY  NOTES 


It.  KtY  MONOS  fCwiNw  «p  IHIIM  •(*  II  nnnnnnmr  «P  iMNMilp  *p  Mae*  ( 


Ranking  and  selection,  Bayes  procedure,  Dirichlet  process. 
Population  quantile. 


ABSTRACT  rCmiwp.  m  iw  WOP  If  »T  AN*  Itmtifr  *r  **•«*  moew?  . 

“^The  Bayesian  approach  has  not  been  very  fruitful  in  treating  nonparaaetric 
statistical  problems,  due  to  the  difficulty  in  finding  mathematically 
tractable  prior  distributions  on  a  set  of  probability  measures.  The  theory 
of  the  Dirichlet  process  has  been  developed  recently.  The  process  generates 
randomly  a  family  of  probability  distributions  which  can  be  taken  as  a 
family  of  prior  distributions  for  the  Bayesian  analysis  of  some  nonpara- 
metrlc  statistical  problems-;  This  paper  deals  with  tha  problem  of  - 


TOWN 

t  JAN  72 


COITION  OP  I  NOV  •>  IS  OBSOLETE 

s/n  o loa-o  14-  ««ot 


UNCLASSIFIED 
v  SSRiSRwS  op 


oglTv  CLAtaif iC*tiqn  0 r  Thi»  PkQtmhm  Otm  l«nw<) 


"selection  a  distribution  with  the  largest  pth  quantile  value, 
from  given  distributions.  It  is  assumed  a  priori  chat 
the  givSh  distributions  have  been  generated  from  a  Dirichlet 
process.. 


